An empirical model of emphatic word detection

نویسندگان

  • Milos Cernak
  • Pierre-Edouard Honnet
چکیده

The paper presents an empirical model of emphatic word detection, as an alternative to conventional machine-learning-based methods. The model is based on the Probabilistic Amplitude Demodulation (PAD) that is iteratively applied for getting syllable and stress modulations, i.e., using the cascaded PAD method. The emphatic words are detected by prominent peaks of the stress modulation and by considering the peaks that are stressed or accented. The cascaded demodulation steered with general purpose values derived from 200ms long average syllable duration, yields to detection accuracy of 81%–83%. Speaker-dependent cascaded demodulation, considering specific speaking rate of the speakers, yields to detection accuracy of 86%–91%. The advantages of the proposed empirical detection model are (i) noise-robustness, (ii) language-independence and (iii) it does not require a training phase.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Identification of Contrast and Its Emphatic Realization in HMM based Speech Synthesis

The work presented in this paper proposes to identify contrast in the form of contrastive word pairs and prosodically signal it with emphatic accents in a Text-to-Speech (TTS) application using a Hidden-Markov-Model (HMM) based speech synthesis system. We first describe a novel method to automatically detect contrastive word pairs using textual features only and report its performance on a corp...

متن کامل

The detection of emphatic words using acoustic and lexical features

In this study, we describe an automatic detector for prosodically salient or emphasized words in speech. Knowledge of whether a word is emphatic or not could improve Text-to-Speech synthesis as well as spoken language summarization. Previous work on emphasis detection has focused on the automatic recognition of pitch accents. Our model extends earlier research by automatically identifying empha...

متن کامل

Automatic Emphatic Information Extraction from Aligned Acoustic Data and Its Application on Sentence Compression

We introduce a novel method to extract and utilize the semantic information from acoustic data. By automatic Speech-ToText alignment techniques, we are able to detect word-based acoustic durations that can prosodically emphasize specific words in an utterance. We model and analyze the sentencebased emphatic patterns by predicting the emphatic levels using only the lexical features, and demonstr...

متن کامل

Identification of contrast and its emphatic realization in HMM based speech synthesis

The work presented in this paper proposes to identify contrast in the form of contrastive word pairs and prosodically signal it with emphatic accents in a Text-to-Speech (TTS) application using a Hidden-Markov-Model (HMM) based speech synthesis system. We first describe a novel method to automatically detect contrastive word pairs using textual features only and report its performance on a corp...

متن کامل

Acoustic Evidence of the Prevalence of the Emphatic Feature over the Word in Arabic

An acoustic study is carried out to see whether the phenomenon of pharyngalization and/or velarizsation is confined to the emphatic consonant and the adjacent vowels or it extends over the whole word in Arabic. Measurements in Hz of F1 & F2 of front unrounded vowels in monosyllabic, bisyllabic and trisyllabic words in ISA having emphatic vs. non-emphatic consonants were made. They showed signif...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015